Robust probabilistic TDOA estimation in reverberant environments

نویسنده

  • Steven J. Rennie
چکیده

In this paper, a novel Expectation-Maximization algorithm for estimating the time-delay-of-arrival of multiple non-stationary sound sources in non-stationary reverberative acoustic environments, is presented. Motivated by the success of the phase-transform/histogram based approaches of Aarabi [2, 3], the algorithm operates by learning a probabilistic relationship between the latent TDOAs and the observed microphone phase over a small collection of short-time DFTs, and in the process automatically estimates the TDOA posterior over the collection of DFTs, of each individual DFT, and also provides a measure of the frequency content of each sound source. Experimental results demonstrate that the algorithm performs as well as the Histogram techniques of Aarabi [2, 3], which have demonstrated until now unmatched results for the problem of acoustic TDOA estimation in natural environments. The model is generative an parametric and thus can potentially be seamlessly fused with probabilistic descriptions of speech production and mixing such as defined in [10], to achieve enhanced speech separation capability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A robust TDOA estimation method for in-car-noise environments

Time difference of arrival (TDOA) estimation is one of the key techniques in array signal processing and has wide applications in hands-free speech interface. TDOA is used to derive the time difference of signal propagation from source to the spatially separated microphones. The acoustic transfer functions ratio (ATF-s ratio) method is one of the robust approaches to get TDOA estimate in presen...

متن کامل

Multi-source TDOA estimation in reverberant audio using angular spectra and clustering

We consider the problem of estimating the time differences of arrival (TDOAs) of multiple sources from a two-channel reverberant audio signal. While several clustering-based or angular spectrum-based methods have been proposed in the literature, only relatively small-scale experimental evaluations restricted to either category of methods have been carried out so far. We design and conduct the f...

متن کامل

Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source

We study the estimation of time difference of arrival (TDOA) under noisy and reverberant conditions. Conventional TDOA estimation methods such as MUltiple SIgnal Classification (MUSIC) are not robust to noise and reverberation due to the distortion in the spatial covariance matrix (SCM). To address this issue, this paper proposes a robust SCM estimation method, called weighted SCM (WSCM). In th...

متن کامل

Multi-source localization in reverberant environments by ROOT-MUSIC and clustering

Localization of acoustic sources in reverberant environments by microphone arrays remains a challenging task in audio signal processing. As a matter of fact, most assumptions of commonly adopted models are not met in real applications. Moreover, in practical systems it is not convenient or possible to employ sophisticated and costly architectures, that require precise synchronization and fast d...

متن کامل

Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions

TDOA(time difference of arrival-) based algorithms are common methods for speech source localization. The generalized cross correlation (GCC) method is the most important approach for estimating TDOA between microphone pairs. The performance of this method significantly degrades in the presence of noise and reverberation. This paper addresses the problem of 3D localization in joint noisy and re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008